Project development consisted of multiple stages:
- EDA
- Twitter User Network Graph Analysis
- Docker Implementation
- Classification of Tweets using PySpark
Please note we were not able to upload the full_tweet_data.csv due to its large size. However, the dataset can be retrieved here: https://afs.tools.iit.cnr.it/f/222bdfb129/